Finnish Native Language Identification

نویسندگان

  • Shervin Malmasi
  • Mark Dras
چکیده

We outline the first application of Native Language Identification (NLI) to Finnish learner data. NLI is the task of predicting an author’s first language using writings in an acquired language. Using data from a new learner corpus of Finnish — a language typology quite different from others previously investigated, with its morphological richness potentially causing difficulties — we show that a combination of three feature types is useful for this task. Our system achieves an accuracy of 70% against a baseline of 20% for predicting an author’s L1. Using the same features we can also distinguish non-native writings with an accuracy of 97%. This methodology can be useful for studying language transfer effects, developing teaching materials tailored to students’ native language and also forensic linguistics.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Phonetic training and non-native speech perception--New memory traces evolve in just three days as indexed by the mismatch negativity (MMN) and behavioural measures.

Language-specific, automatically responding memory traces form the basis for speech sound perception and new neural representations can also evolve for non-native speech categories. The aim of this study was to find out how a three-day phonetic listen-and-repeat training affects speech perception, and whether it generates new memory traces. We used behavioural identification, goodness rating, d...

متن کامل

L2 development of quantity perception: dutch listeners learning Finnish /t-t: /

The perceptual development of Dutch listeners learning to perceive the Finnish quantity contrast /t-t / was studied. It is shown that short laboratory training is (i) sufficient to change identification of relevant speech sounds, but (ii) insufficient to substantially change perceptual sensitivity along the phoneme continuum. Furthermore, L2 learners need much more relevant language experience,...

متن کامل

The Association between Patient-Reported Pain and Doctors' Language Proficiency in Clinical Practice

Patients' limited literacy and language fluency of different kinds cause them problems in navigating the medical interview. However, it is not known how physicians' native language skills affect the reported intensity of pain among Finnish emergency patients. Data were collected with two consecutive questionnaires in 16 healthcare centres and outpatient departments along the Finnish coast. Swed...

متن کامل

Musical Sophistication and the Effect of Complexity on Auditory Discrimination in Finnish Speakers

Musical experiences and native language are both known to affect auditory processing. The present work aims to disentangle the influences of native language phonology and musicality on behavioral and subcortical sound feature processing in a population of musically diverse Finnish speakers as well as to investigate the specificity of enhancement from musical training. Finnish speakers are highl...

متن کامل

Lexical Adaptation to a Novel Accent in German: A Comparison Between German, Swedish, and Finnish Listeners

Listeners usually adjust rapidly to unfamiliar regional and foreign accents in their native (L1) language. Non-native (L2) listeners, however, usually struggle when confronted with unfamiliar accents in their non-native language. The present study asks how native language background of L2 speakers influences lexical adjustments in a novel accent of German, in which several vowels were systemati...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014